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Abstract 

We exhibit families of 4-CNF formulas over n variables that have sums-of-squares (SOS) proofs 
of unsatisfiability of degree (a.k.a. rank) d but require SOS proofs of size np( d) for values of d = d(n) 
from constant all the way up to n s for some universal constant S. This shows that the running 
time obtained by using the Lasserre semidefinite programming relaxations to find degree-c? SOS 
proofs is optimal up to constant factors in the exponent. We establish this result by combining 
NP-reductions expressible as low-degree SOS derivations with the idea of relativizing CNF formu¬ 
las in [Krajicek’04] and [Dantchev and Riis ’03], and then applying a restriction argument as in 
[Atserias, Muller, and Oliva ’13] and [Atserias, Lauria, and Nordstrom T4], This yields a generic 
method of amplifying SOS degree lower bounds to size lower bounds, and also generalizes the ap¬ 
proach in [ALN14] to obtain size lower bounds for the proof systems resolution, polynomial calculus, 
and Sherali-Adams from lower bounds on width, degree, and rank, respectively. 


1 Introduction 

Let /i,..., f s € R[ati,..., x n \ be real, multivariate polynomials. Then the Positivstellensatz proven 
in [Kri64, Ste73] says (as a special case) that the the system of equations 

/t = 0, = 0 (1.1) 

has no solution over W * 1 if and only if there exist polynomials fj :l , qg <G M [:/; i,..., x n ] such that 

S 

• ( 1 - 2 ) 

i=i e 

That there can exist no solution given an expression of the form (1.2) is clear, but what is more in¬ 
teresting is that there always exists such an expression to certify unsatisfiability. We refer to (1.2) as 
a Positivstellensatz proof or Sums-of-squares (SOS) proof of unsatisfiability, or as an SOS refutation ,* 
of (1.1). We remark that the Positivstellensatz also applies if we add inequalities hi > 0,..., ht > 0 to 
the system of equations and allow terms —hj qj t on the right-hand side in (1.2). 

The degree 2 of an SOS refutation is the maximal degree of any gqfj. The search for proofs of 
constant degree d is automatizable as shown in a sequence of works by Shor [Sho87], Nesterov [NesOO], 
Lasserre [LasOl], and Parrilo [ParOO]. What this means is that if there exists a degree-d SOS refutation 
for a system of polynomial equalities (and inequalities) over n variables, then such a refutation can 
be found in polynomial time n 0( - d K Briefly, one can view (1.2) as linear system of equations in the 

This is the full-length version of the paper with the same title to appear in Proceedings of the 30th Annual Computational 
Complexity Conference (CCC ’15). 

1 All proofs for systems of polynomial equations or for formulas in conjunctive normal form (CNF) in this paper will be 
proofs of unsatisfiability, and we will therefore use the two terms “proof” and “refutation” interchangeably. 

This is sometimes also referred to as the “rank,” but we will stick to the term “degree” in this paper. 
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coefficients of gj and u = Qe with the added constraint that u is a sum of squares, and such a system 
can be solved by semidefinite programming in d/2 rounds of the Lasserre SDP hierarchy. 

In the last few years there has been renewed interest in sums-of-squares in the context of constraint 
satisfaction problems (CSPs) and hardness of approximation, as witnessed by, for instance, [BBH + 12, 
OZ13, Tul09]. These works have highlighted the importance of SOS degree upper bounds for CSP 
approximability, and this is currently a very active area of study. 

Our focus in this paper is not on algorithmic questions, however, but more on sums-of-squares viewed 
as a proof system (also referred to in the literature as Positivstellensatz or Lasserre). This proof system 
was introduced by Grigoriev and Vorobjov [GV01] as an extension of the Nullstellensatz proof system 
studied by Beame et al. [BIK+94], and Grigoriev established SOS degree lower bound for unsatisfiable 
F 2 -linear equations [GriOlb] (also referred to as the 3-XOR problem when each equation involves at 
most 3 variables) and for the knapsack problem [GriOla]. 

Given the connections to semidefinite programming and the Lasserre SDP hierarchy, it is perhaps 
not surprising that most works on SOS lower bounds have focused on the degree measure. However, 
from a proof complexity point of view it is also natural to ask about the minimal size of SOS proofs, 
measured as the number of monomials when all polynomials in each term in (1.2) are expanded out as 
linear combinations of monomials. Such SOS size lower bounds were proven for knapsack in [GHP02] 
and F 2 -linear systems of equations in [KI06], 3 and tree-like size lower bounds for other formulas were 
also obtained in [PS 12]. 

A wider interest in this area of research was awakened when Schoenebeck [Sch08] (essentially) 
rediscovered Grigoriev’s result [GriOlb], which together with further work by Tulsiani [Tul09] led to 
integrality gaps for a number of constraint satisfaction problems. There have also been papers such as 
[BPS07] and [GP14] focusing on semantic versions of the proof system, with less attention to the actual 
syntactic derivation rules used. We refer the reader to, for instance, the introductory section of [OZ13] for 
more background on sums-of-squares and connections to hardness of approximation, and to the survey 
[BS14] for an in-depth discussion of SOS as an approximation algorithm and the intriguing connections 
to the so-called Unique Games Conjecture [Kho02]. 

1.1 Our Contribution 

As discussed above, if a system of polynomial equalities and inqualities over n variables can be shown 
inconsistent by SOS in degree d, then by using semidefinite programming one can find an SOS refutation 
of the system in time n°( d \ It is natural to ask whether this is optimal, or whether there might exist 
“shortcuts” that could lead to SOS refutations more quickly. 

We prove that there are no such shortcuts in general, but that the running time obtained by using the 
Lasserre semidefinite programming relaxations to find SOS proofs is optimal up to the constant in the 
exponent. We show this by constructing formulas on n variables (which can be translated to systems of 
polynomial equalities in a canonical way) that have SOS refutations of degree d but require refutations 
of size . Our lower bound proof works for d from constant all the way up to rd for some constant 6. 

Theorem 1.1 (informal). Let d = d(n) < rd where 5 > 0 is a universal constant. Then there is a 
family of 4-CAT 7 formulas { F n }neN- with 0(n 2 ) clauses over O(n) variables such that F n is refutable 
in sums-of-squares in degree 0(d) but any SOS refutation of F n requires size np( d \ 

This theorem extends an analogous result joint by the two authors with Atserias in [ALN14] for the 
proof systems resolution, polynomial calculus, and Sherali-Adams, 4 * where upper bounds on refutation 
size in terms of width, degree, and rank, respectively, were shown to be tight up to the multiplicative 
constant in the exponent. Theorem 1.1 works for all of these proof systems, since the upper bound is in 
fact on resolution width (i.e., the size of a largest clause in a resolution refutation), not just SOS degree, 

3 It might be worth pointing out that definitions and terminology in this area have suffered from a certain lack of standard¬ 
ization, and so what [KI06] refers to as “static Lovasz-Schrijver calculus” is closer to what we mean by SOS/Lasserre. 

4 The exact details of these proof systems are not important for this discussion, and so we choose not to elaborate further 

here, instead referring the interested reader to [ALN14], 
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and in this sense the theorem subsumes the results in [ALN14]. The concrete bound we obtain for the 
exponent inside the asymptotic notation in the ri Aid) size lower bound is very much worse, however, and 
therefore the gap between upper and lower bounds is very much larger than in [ALN14]. 

We want to emphasize that the size lower bound in Theorem 1.1 holds for SOS proofs of arbitrary 
degree. Thus, going to higher degree (i.e., higher levels of the Lasserre SDP hierarchy) does not help, 
since even arbitrarily large degree cannot yield shorter proofs. This is an interesting parallel to the 
paper [LRST14] exhibiting problems for which a (symmetric) SDP relaxation of arbitrary degree but 
bounded size n d does not do much better than the systematic relaxation of degree d. 

1.2 Techniques 

We obtain the result in Theorem 1.1 as a special case of a more general method of amplifying lower 
bounds on width (in resolution), degree (in polynomial calculus) and rank/degree (in Sherali-Adams and 
Lasserre/SOS) to size lower bounds in the corresponding proof systems. This method is in some sense 
already implicit in [ALN14], which in turn relies heavily on an earlier paper by Atserias et al. [AM013], 
but it turns out that extracting the essential ingredients and making them explicit is helpful for extending 
the results in [ALN14] to an analogue for sums-of-squares. We give a brief, informal description of the 
three main ingredients of the method below. 

(i) Find a base CNF formulas hard with respect to width/degree/rank To start, we need to 
find a base problem, encoded as an unsatisfiable CNF formula, that is “moderately hard” for the proof 
system at hand. What this means is that we should be able to prove asymptotically tight bounds on width 
if we are dealing with resolution, on degree for polynomial calculus, and on degree/rank for Sherali- 
Adams and sums-of-squares. It then follows by a generic argument (as discussed briefly above for SOS) 
that a bound 0(d) on width/degree/rank implies an upper bound n° d,) on proof size. 

In [AMO 13, ALN14] the pigeonhole principle served as the base problem. This principle, which has 
been extensively studied in proof complexity, is encoded in CNF as pigeonhole principle (PHP) formulas 
saying that there is a one-to-one mapping of m pigeons into n pigeonholes for rn > n. For sums-of- 
squares we cannot use PF1P formulas, however, since they are not hard with respect to SOS degree. 
Instead we construct an SOS reduction in low degree from inconsistent systems of F 2 -I incar equations 
to the clique problem, and then appeal to the result in [GriOlb, Sch08] briefly discussed above to obtain 
the following degree lower bound. 

Theorem 1.2 (informal). Given k € N + , there is a graph G and a 3 -CNF formula fc-Clique(G) of size 
polynomial in k with the following properties: 

1. The graph G does not contain a k-clique, but the formula /.’-Clique) C/) claims that it does. 

2. Resolution can refute fc-Clique(G) in width k. 

3. Any sums-of-squares refutation of k-Clique(G) requires degree il(k). 

(ii) Relativize the CNF formulas The second step is to take the formulas for which we have estab¬ 
lished width/degree/rank lower bounds and relativize them. Relativization is an idea that seems to have 
been considered for the first time in the context of proof complexity by Krajicek [Kra04] and that was 
further developed by Dantchev and Riis [DR03]. Very loosely, it can be described as follows. 

Suppose that we have a CNF formula encoding (the negation of) a combinatorial principle saying 
that some set S has a property. For instance, the CNF formula could encode the pigeonhole principle 
discussed above, or could claim the existence of a totally ordered set of n elements where no element in 
the set is minimal with respect to the ordering (these latter CNF formulas are known as ordering principle 
formulas, least number principle formulas, or graph tautologies in the literature). 

The formula at hand is then relativized by constructing another formula encoding that there is a (po¬ 
tentially much larger) set T containing a subset S CT for which the same combinatorial principle holds. 
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For the ordering principle, we can encode that there exists a non-empty ordered subset S C T of arbitrary 
size such that it is possible for all elements in S to find a smaller element inside S. This relativization 
step transforms the previously very easy ordering principle formulas into relativized versions that are 
exponentially hard for resolution [Dan06, DM14]. For the PF1P formulas, we specify that we have a set 
of M S> m pigeons mapped into into n < m holes such that there exists a subset of m pigeons that are 
mapped injectively. 

In our setting, it will be important that the relativization does not make the formulas too hard. We do 
not want the hardness to blow up exponentially and instead would like the upper bound obtained in the 
first step above to scale nicely with the size of the relativization. For our general approach to work, we 
therefore need formulas talking about some domain being mapped to some range, where we can enlarge 
the domain while keeping the range fixed, and where in addition the mapping is symmetric in the sense 
that permuting the domain does not change the formula. 

For this reason, relativizing the ordering principle formulas does not work for our purposes. Pigeon¬ 
hole principle formulas have this structure, however, which is exactly why the proofs in [ALN14] go 
through. As already mentioned, PHP formulas will not work for sums-of-squares, but we can relativize 
the formulas in Theorem 1.2 by saying that there is a large subset of vertices such that there is a fc-clique 
hiding inside such a subset. 

(iii) Apply random restrictions to show proof size lower bounds In the final step, we use 
random restrictions to establish lower bounds on proof size for the relativized CNF formulas obtained in 
the second step. This part of the proof is relatively standard, except for a crucial twist in the restriction 
argument introduced in [AMO 13]. 

Assume that there is a small refutation in sums-of-squares (or whatever proof system we are studying) 
of the relativized formula claiming the existence of a subset of size m <C M with the given combinatorial 
property. Now hit the formula (and the refutation) with a random restriction that in effect chooses a subset 
of size m, and hence gives us back the original, non-relativized formula. This restriction will be fairly 
aggressive in terms of the number of variables set to fixed truth values, and hence it will hold with high 
probability that the restricted refutation has no monomials of high degree (or, for resolution, no clauses 
of high width), since all such monomials will either have been killed by the restriction or at least have 
shrunk significantly. (We remark that making use of this shrinking in the analysis is the crucial extra 
feature added in [AMO 13].) But this means that we have a refutation of the original formula in degree 
smaller than the lower bound established in the first step. Hence, no small refutation can exist, and the 
lower bound on proof size follows. 

This concludes the overview of our method to amplify lower bounds on width/degree/rank to size. 
It is our hope that developing such a systematic approach for deriving this kind of lower bounds, and 
making explicit what conditions are needed for this approach to work, can also be useful in other contexts. 

1.3 Organization of This Paper 

The rest of this paper is organized as follows. We start in Section 2 by reviewing the definitions and 
notation used, and also stating some basic facts that we will need. In Section 3, we prove a degree 
lower bound for CNF formulas encoding a version of the clique problem. We then present in Section 4 a 
general method for obtaining SOS size lower bounds from degree lower bounds (or from width, degree, 
and rank, respectively, for proof systems such as resolution, polynomial calculus, and Sherali-Adams). 
We conclude with a brief discussion of some possible directions for future research in Section 5. 

2 Preliminaries 

For a positive integer n, we use the standard notation [n] = {1,2,..., n }. All logarithms in this paper 
arc to base 2. A CNF formula F is a conjunction of clauses, denoted F = /\-Cj, where each clause C is 
a disjunction of literals, denoted C = \J i a, . Each literal a is either a propositional variable x (a positive 
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literal) or its negation x (a negative literal). We think of formulas and clauses as sets, so that there is no 
repetition and order does not matter. We consider polynomials on the same propositional variables, with 
the convention that, as an algebraic variable, x evaluates to 1 when it is true and to 0 when it is false. 
All polynomials in this paper are evaluated on 0/1-assignments, and live in the ring of real multilinear 
polynomials, which is the ring of real polynomials modulo the ideal generated by polynomials xf — ay 
for all variables ay. In other words, all variables in all monomials have degree at most one, and monomial 
multiplication is defined by (n, GA ®i) ' (Flies = UieAuB x i- 

Since sums-of-squares derivations operate with polynomial equations and inequalities, in order to 
reason about CNF formulas we need to encode them in this language. For a clause C = C + V C~, where 
we write C + and C~ to denote the subsets of positive and negative literals, respectively, we define 


5(C) = £ x + E t 1 -*) 
xec+ xec~ 


( 2 . 1 ) 


and encode C as the inequality 


5(C) > 1 . 


( 2 . 2 ) 


Clearly, a clause C is satisfied by a 0/1-assignment if and only if the same assignment satisfies the 
inequality 5(C) > 1. For a variable x and a bit j3 £ {0,1}, we define 


1 — x if f3 = 0, 
x if /3 = 1; 


(2.3) 


and for a sequence of variables x = (ay 1; . .. x hi: ) and a binary string 8 = (8\ ,... 8 W ), we define the 
indicator polynomial 

W 

S X=P = IJ Sxi^Pj (2-4) 

3=1 

expanded out as a linear combination of monomials. That is, 6g=p is the polynomial that evaluates 
to 1 for 0/1-assignments satisfying the equalities ay. = 8j for j = 1..... r/; and to 0 for all other 
0/1-assignments. We have the following useful fact. 


Fact 2.1. For every sequence of variables x the syntactic equality (F0y G {o i}» $x=f}) = 1 holds (after 
cancellation of terms). 


Let F be a CNF formula over some set of variables denoted as Vars(F), and let p be a partial 
assignment on Vars(F). We write F\ p to denote the formula F restricted by p, where all clauses C G F 
satisfied by p are removed and all literals falsified by p in other clauses are removed. For a polynomial p 
over variables Vars(F) (written, as always, as a linear combination of distinct monomials), we let p\ p 
denote the polynomial obtained by substituting values for assigned variables and removing monomials 
that evaluate to 0. We extend this definition to sets of formulas or polynomials in the obvious way by 
taking unions. 


Definition 2.2 (Sums-of-squares proof system). A sums-of-squares derivation, or SOS derivation for 
short, of the polynomial inequality p > 0 from the system of polynomial constraints 


/i = 0,..., f s = 0, hi>0,...,ht>0 


(2.5) 


P = 9jfj + VC + “0 , (2.6) 

3=1 3=1 

where g\,.... g s are arbitrary polynomials and each Uj is expressible as a sums of squares qj f . 
A derivation of the equation p = 0 is a pair of derivations of p > 0 and —p > 0. A sums-of-squares 
refutation of (2.5) is a derivation of the inequality — 1 > 0 from (2.5). 
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The degree of an SOS derivation is the maximum degree among all the polynomials gjfj, Ujhj, 
and uq in (2.6). The size of an SOS derivation is the total number of monomials (counted with repetition) 
in all polynomials gjfj, Ujhj, and uq (all expanded out as linear combinations of distinct monomials). 
The size and degree of refuting an unsatisfiable system of polynomial constraints are defined by taking 
the minimum over all SOS refutations of the system with respect to the corresponding measure. 

Remark 2.3. Readers more familiar with the usual definition of Positivstellensatz/sums-of-squares in 
the literature might be a bit puzzled by the use of multilinearity in Definition 2.2, and might also wonder 
where the axioms x\ — Xi = 0, x t > 0, and 1 — x r > 0 for every variable x, disappeared. It is important 
to note that we have these axioms in our multilinear setting as well, although they are not explicitly 
mentioned. Equations of the form xf — X{ = 0 are tautological due to multilinearity, and the inequalities 
Xi > 0 and 1 — xy > 0 are derivable by the squaring rule since in the multilinear' setting we have x h = xf 
and 1 — Xi = (1 — Xi) 2 . 

Our choice of the multilinear setting is without any loss of generality and only serves to simplify 
the technical arguments slightly. It is easy to see that applying the multilinearization operator mapping 
x\ to Xi for every £ > 1 to any SOS derivation over real polynomials yields a legal SOS derivation 
over multilinear real polynomials in at most the same size and degree. Thus, working in the multilinear 
setting can only make our lower bounds stronger. As to the upper bounds in this paper, we prove them 
in the resolution proof system discussed below, and the simulation of resolution by sums-of-squares in 
Lemma 2.6 below works also in the standard setting without multilinearization. 

Let us state some useful basic properties of multilinear polynomials for later reference (and also 
provide a proof just for completeness). 

Proposition 2.4 (Unique multilinear representation). Every function f : {0,1}" —>• M has a unique 
representation as a multilinear polynomial. In particular, if p is a multilinear polynomial such that 
p(a) € {0,1 } for all a € {0, l} n , then for every positive integer I the equality pr = p holds (where 
this is a syntactic equality of multlinear polynomials expanded out as linear combinations of distinct 
monomials). 

Proof. The set of functions from {0, l} 71 to M is a vector space of dimension 2 n . Any function f(x) in 
this space can be represented as a linear combination X^/3e{o i}" f(P) ' T?= a(•'?)■ Since each 8g=p is a 
multilinear polynomial the multilinear monomials on n variables are a set of 2" generators of the vector 
space. By linear independence they also form a basis, and hence the representation of a function as a 
lineal - combination of multilinear monomials is unique. The second part of the proposition now follows 
immediately since pr and p compute the same function. □ 

The upper bounds in this paper are shown in the weaker proof system resolution, which is defined 
as follows. A resolution derivation of a clause D from a CNL formula F is a sequence of clauses 
(D\, I)‘ 2 ■..., D t ) such that I) T = D and for every clause I), it holds that it is either a clause of F 
(an axiom), or is obtained by weakening from some Dj C I), for j < i, or can be inferred from two 
clauses D(,Dj, £ < j < i, by the resolution rule that allows to derive the clause A V B from two 
clauses AVi and B V x (where we say that A V x and B V x arc resolved on x to yield the resolvent 
A V B). If in a resolution derivation (I)\. B>i■ ■ ■ ■, D T ) each clause Dj is only used once in a weakening 
or resolution step to derive some Di for i > j, we say that the derivation is tree-like (such derivations 
may contain multiple copies of the same clause). A resolution refutation of F, or resolution proof for F, 
is a derivation of the empty clause (the clause containing no literals) from F. 

The width of a clause is the number of literals in it, and the width of a CNL formula or resolution 
derivation is the maximal width of any clause in the formula or derivation. The size of a resolution 
derivation is the total number of clauses in it (counted with repetitions). The size and width of refuting 
an unsatisfiable CNL formula F is defined by taking the minimum over all resolution refutations of F 
with respect to the corresponding measure. 

The following standard fact is easy to establish by forward induction over resolution derivations. We 
omit the proof. 
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Fact 2.5. Consider a partial assignment p which assigns £ variables. Let A be the unique clause of 
width £ such that A evaluates to false under p. If resolution can derive C in width w and size Sfrom F\ p , 
then resolution can derive A\J C in width at most w + (' and size at most S + 1 from F. 

Let us also state for the record the formal claim that SOS is more powerful than resolution in term 
of degree (and for constant degree also in terms of size). The next lemma is essentially Lemma 4.6 
in [ALN14], except that there the lemma is stated for the Sherali-Adams proof system. Since SOS 
simulates Sherali-Adams efficiently with respect to both size and degree, however, the same bounds apply 
also for SOS. Referring to the discussion in Remark 2.3, it should also be pointed out that the lemma 
in [ALN14] is proven in the more common non-multilinear setting with explicit axioms xf — Xi = 0, 
Xi > 0, and 1 — Xi > 0 for all variables ay. 

Lemma 2.6 (SOS simulation of resolution). If a CNF formula F = f\ j = \ Cj has a resolution refutation 
of size S and width w, then the constraints { S(Cj ) > 1 }*■ _, as defined in (2.1) and (2.2) have an SOS 
refutation of size 0( w2 w S) and degree at most w + 1. 

The next lemma will be useful as a subroutine when we prove upper bounds in resolution. 

Lemma 2.7. Let k and mi, m 2 , ■ ■ ■ m k be positive numbers. Then the CNF formula consisting of the 
clauses 


Vi, 0 


i e [k], 

(2.7a) 

Vi, j-1 V Xij 

v yip 

i € [k], j € [m^, 

(2.7b) 

Vi^rrii 


i € [A;], 

(2.7c) 

x Lh V X2 ,h 

''' V Xkj k 

(j 1 , • • -,jk) £ [mi] x • • • x [m k \, 

(2.7d) 


has a resolution refutation of width k + 1 and size 0(]~[[ = i m i)- 

Proof. We prove the lemma by backwards induction over k. Consider any clause A of the form 

A = x lih V x 2 , h • • • V %_i) (2.8) 

for 1 < i < k (and note that for i — 1 this is the empty clause). We will show how to derive A in width 
i + 1 given clauses A V x h i, A V 2 ^ 2 , ..., A V Xi tTni . 

We start by resolving the axioms and Ij, 0 V V , and then we apply the resolution rule 
again on this resolvent and the clause A V xyi (available by the induction hypothesis) to get A V yip. 
We now deduce A V y t> j for increasing j. Suppose we have already obtained A V Vij-i- Using the 
inductively derived clause A V x^j and the axiom y % l _ { V Xij V y,. ? , we can resolve on variables y % .j-\ 
and X{ : j to obtain A V y l .j. Once A V 'ip.m, has been derived, we resolve it with the axiom y i m . to get A. 
By backward induction we reach the empty clause for i = 1, which concludes the resolution refutation. 
Since i < k. the refutation has width k +1. It is easy to verify that all axioms and intermediate clauses 
in the refutation are used exactly once. Thus, the refutation is tree-like, and has size exactly twice the 
number of axioms clauses minus one, which, in particular, is tnA. □ 

When we construct formulas to be relativized as described in Section 1.2, it is convenient to use 
variables x t j, where i ranges over some specific domain D and J is a collection of other indices. We 
say that the variable x t j mentions the element i € D. The domain-width of a clause is the number of 
distinct elements of D mentioned by its variables. The domain-width of a CNF formula or resolution 
proof is defined by taking the maximum domain-width over all its clauses, and the domain-width of 
refuting a CNF formula F is the minimal domain-width of any resolution refutation of F. Similarly, the 
domain-degree of a monomial is the number of distinct elements in D mentioned by its variables, the 
domain-degree of a polynomial or SOS proof is the maximal domain-degree of any monomial in it, and 
the domain-degree of refuting an unsatisfiable system of polynomial constraints is defined by taking the 
minimum over all refutations. 
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3 A Degree Lower Bound for Clique Formulas 

In this section we state and prove the formal version of Theorem 1.2, namely a lower bound for the 
domain-degree needed in SOS to prove that a graph G has no /.’-clique. Let us start by describing how 
we encode the A-clique problem as a CNF formula. 

Definition 3.1 (A-clique formula). Let A be a positive integer, G = ( V. E) be an undirected graph 
on N vertices, and {v\,V 2 , ■ ■ ■, vn) be an enumeration of V(G) = V. Then the formula A-Clique(G') 
consists of the clauses 


%i,lL V 

i, i' £ [A], i yA i', {u, v} ^ E(G), 

(3.1a) 

X%fu, V Xi,v 5 

i £ [A], u, v £ V ( G ), u yA v. 

(3.1b) 

Zi t 0 

i € [A], 

(3.1c) 

%h{j— 1) ^ %i,Vj V %i,j 

* e [A], j £ [iV], 

(3. Id) 

%i,N 

i £ [A]. 

(3-le) 


The formula A-Clique(G) encodes the claim that G has a clique of size k. The intended meaning 
of the variable x l)V for v £ V(G) is that v is the ith vertex of the clique. The clauses in (3.1a) enforce 
that any two members of the clique are distinct and are connected by an edge. The clauses in (3.1b) 
enforce that at most one vertex is chosen for each i £ [k\. The clauses in (3.1c)-(3.1e) are simply the 
3-CNF encoding (using extension variables) of the clause VyLi x i,vj enforcing that at least one vertex 
is chosen for each i £ [k]. The variables of A-Clique(G') are indexed by i over the domain [k] and 
the domain-width of the formula is 2. The next proposition shows that the naive brute-force approach to 
decide A-Clique(G) can be carried on in resolution (and hence by Lemma 2.6 also in SOS). 

Proposition 3.2. If G has no clique of size k, then A-Clique(Cr) has a resolution refutation of size 
0(|F| fc ) and width k + 1. 

Proof We first use the weakening rule to derive all clauses of the form 

%i,ui V 'E2,U2 v ' • • V Xh^ Uk (3.2) 

for every sequence of vertices («i, uo ,... , up.). This is possible since either the sequence contains a 
repetition or it includes two vertices with no edge between them, and in both cases this means that the 
clause (3.2) is a superclause of some clause of the form (3.1a). Then we derive the empty clause by 
applying Lemma 2.7 to the clauses (3.1c)-(3.1e) and (3.2). □ 

In order to obtain suitably hard instances of A-Clique (G) we construct a reduction from 3-XORs 
to A;-partite graphs. It is convenient for us to describe the special case of A-clique on A-partite graphs 
directly as an encoding as polynomial equations and inequalities as follows next. 

Definition 3.3 (Polynomial encoding of A-clique on A-partite graphs). For a A-partite graph G with 
V{G) = V\ U Vi U • • • U14 we let A-Block(G) denotes the following collection of polynomial constraints: 


Y.x v = i 

i £ [A], 

(3.3a) 

ve Vi 



x u + x v <1 

u £ Vi,v £ Vi', i f %, {rq u} 0 E(G). 

(3.3b) 


It is straightforward to verify that these constrants encode the claim that G has a clique with one 
element in each block V % , since exactly one element is chosen from each block by (3.3a) and all the 
chosen elements have to be pairwise connected by (3.3b). 

Any lower bound on degree that we establish for A-Block(G) will hold also for A-Clique(G) as 
stated in the following proposition. 


3 A Degree Lower Bound for Clique Formulas 


Proposition 3.4. Consider a k-partite graph G, where V (G) = V\ U V 2 U • ■ ■ U 14 . 7/'fc-Clique(G) has 
an SOS refutation in domain-degree d, then Ac-Block(G) has an SOS refutation in domain-degree d. 

Proof The proof is by transforming a refutation of fc-Clique(G) into a refutation of fc-Block(G) of 
the same domain-degree. To give an overview, we start with a refutation of fe-Clique(G) of domain- 
degree d and replace its variables with polynomials of degree at most 1 mentioning only variables from 
fc-Block(G). In this way we get an SOS refutation of domain-degree at most d from the substituted 
axioms of /c-Clique(G). The latter polynomials are not necessarily axioms of /c-Block(G), but we show 
that they have SOS derivations of domain-degree 1 from the axioms of A;-Block(6'). This concludes the 
proof. 

The variable substitution has two steps: first we substitute every variable z- hl with the linear form 
^2tLj +1 x i,vt < where { Vj } ; v = , is the enumeration of V(G) in Definition 3.1, and then we set x: l/Vj to 0 
whenever Vj f V r . 

As mentioned above, we now need to give SOS derivations of domain-degree 1 of all transformed 
axioms in fc-Clique(G) from fc-Block(G). For the axioms (3.1c)-(3.1e), the SOS encoding is 


Zi, 0 > 1 

i € [k], 

(3.4a) 

(l ~ z i,(j-l)) + Xi, Vj + Zij > 1 

i € [k\,j £ [iV], 

(3.4b) 

(1 — Z^n) > 1 

i £ [k]. 

(3.4c) 


After the first step of the substitution the inequalities (3.4a), (3.4b) and (3.4c) become, respectively, 
the inequality x i,vj — 1’ an ^ two occurrences of tautology 1 > 1. Furthermore, after the second 

step of the substitution the inequality (3.4a) becomes Ylv&v Xi x — which is subsumed by Equa¬ 
tion (3.3a). Each of the axioms (3.1a) and (3.1b) is encoded as 

1 tEiqv A 0 (3*5) 

for some pair of indices i. i' and vertices u. v. We assume that a £ V, and v £ Vy, because otherwise 
the variable substitution turns the inequality into either a tautology or into 1 — x l)U > 0 , where the 
latter follows from (1 — x l)U )‘ 2 > 0 by multilinearity. If i f i' then the inequality (3.5) is an axiom of 
fc-Block(G'). If that is not the case, then we can obtain 1 — Xi tU — Xi iV in domain-degree 1 using the 
derivation 

1 ^ ' x i,w 4~ ^ ^ {. x i,u 1 ) 

v€Vi w£{u,v} 

from Equation (3.3a) sum of squares 

where the first identity holds by multilinearity. The proposition follows. □ 

What we want to do now is to prove a domain-degree lower bound for instances of fc-Block(G) 
where the graph G is obtained by a reduction from (unsatisfiable) sets of F 2 -linear equations. We rely 
on the version of Grigoriev’s degree lower bound [GriOlb] shown by Schoenebeck [Sch08], which is 
conveniently stated for random 3-XOR formulas as encoded next. 

Definition 3.5 (Polynomial encoding of random 3-XOR). A random 3-XOR formula f represents a 
system of An linear equations modulo 2 defined over n variables. Each equation is sampled at random 
among all equations of the form x ® y @ z = b as follows: x, y, z are sampled uniformity without 
replacement from the set of n variables and b is sampled uniformly in {0,1}. The polynomial encoding 
of any such linear equation modulo 2 is 


(1 - x){l -y)z = 0 

(3.7a) 

O 

II 

l 

r—1 

l-T 

i 

T— 1 

(3.7b) 

x{l - y)( 1 -z) = 0 

(3.7c) 

xyz = 0 

(3.7d) 


= 1 - 


X 

veVi 


%i,w + x %i,w 

w^{u,v} 


— 1 %i,u 


(3.6) 
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when 6 = 0 and 


(1 - x)(l - y){ 1 -z) = 0 


(3.7e) 

(3-7f) 

(3.7g) 

(3.7h) 


xy( 1 — z) = 0 
x(l — y)z = 0 
(1 — x)yz = 0 


when 6 = 1 . 


Fixing 6 = 1/4 and A = 8 in [Sch08] we have the following theorem. 

Theorem 3.6 ([Sch08]). There exists an a, 0 < a < 1, such that for every e > 0 there exists an 
n e £ N such that a random 3-XOR formula f in n > n e variables and 8 n constraints has the following 
properties with probability at least 1 — e. 

1. At most 6 n parity constraints off can be simultaneously satisfied. 

2. Any sums-of-squares refutation of 0 requires degree an. 

Now we arc ready to describe how to transform a 3-XOR formula 0 into a A-partite graph G k that 
has a clique of size k if and only if 0 is satisfiable. 

Definition 3.7 (3-XOR graph). Given k £ N and a 3-XOR formula 0 with 8 n constraints over n vari¬ 
ables, where we assume for simplicity that k divides 8 n, we construct a 3-XOR graph G/ as follows. 

We arbitrarily split the formula 0 into k linear systems with 8 n/k constraints each, denoted as 
fi , 02 ,... 0k- For each 0) t we let V, be a set of at most N < 2 24n,//i ' vertices labelled by all possible 
assignments to the at most 24 n/k variables appearing in 0> % . For two distinct vertices u £ Vi and v £ If 
there is an edge between u and v in G/ if the two assignments corresponding to u and v are compatible, 
i.e., when they assign the same values to the common variables, and also the union of the two assign¬ 
ments does not violate any constraint in f. (In particular, each V is an independent set, since two distinct 
assignments to the same set of variables are not compatible.) 

The key property of the reduction in Definition 3.7 is that it allows small domain-degree refutations 
of 6 -Block (G^) to be converted into small degree refutations of 0. 

Lemma 3.8. If 6 -Block (G^) has an SOS refutation of domain-degree d, then f has an SOS refutation 
of degree 2Adn/k. 

Proof. Again we start by giving an overview of the proof, which works by transforming a refutation of 
6 -Block (G^) of domain-degree d into a refutation of 0 of degree 24 dn/k. 

Given a refutation of 6 -Block (G^) of domain-degree d, we replace every variable x v with a polyno¬ 
mial over the variables of 0. In this way we get an SOS refutation from the polynomials corresponding 
to the substituted axioms of 6 -Block (G^). The latter polynomials need not be axioms of f, but we show 
that they can be efficiently derived in SOS from <f>. We thus obtain an SOS refutation of f, the degree of 
which is easily verified to be as in the statement of the lemma. 

We now describe the substitution in detail. Consider a block Vi and suppose that the corresponding 
3-XOR formula </>,; mentions t variables. Let us write x to denote this set of variables. Then every 
vertex v £ Vi represents an assignment /3 £ {0,1}* to x. In what follows, we denote the indicator 
polynomial 5g = p in (2.4) by S v for brevity, and we substitute for each variable x v the polynomial S v of 
degree t < 24 n / k. 

Before the substitution each monomial in the original refutation has domain-degree at most d by 
assumption. Two important observations are that (d v ) 2 = 5 V for every v £ Vi and that 5 U 5 V = 0 for 
every two distinct u, v in the same block V,. Therefore, after the substitution each monomial is either 
identically zero or the product of at most d indicator polynomials, and hence its degree is at most 2Adn/k. 
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To verify these observations, note that the identity (d v ) 2 = S v holds by Proposition 2.4. The equality 
S U S V = 0 holds because 5 U and 5 V are the indicator polynomials of two incompatible assignments, and so 
their product always evaluates to zero. Applying Proposition 2.4 again, we conclude that the (multilinear) 
polynomial 5 U 5 V is identically zero. 

In order to complete the proof outline above, we now need to present SOS derivations starting from 
the 3-XOR constraints of d> of all polynomial constraints resulting from the substitutions in the axioms 
of fc-Block(G^) described above, and to do so in degree at most 24n/k. 

Let us first look at the axioms (3.3a). By Fact 2.1, the identity 

^ 5 V = S s=P = 1 (3-8) 

veVi / 3 g { o,i }* 

holds syntactically, so substitutions in axioms of the form (3.3a) result in tautologies 1 = 1. 

The remaining axioms of fc-Block(G^) in (3.3b) have the form x u + x v < 1 for non-edges (u. v) 
between vertices in different blocks. By construction of G l ' 0 the reason u and v are not connected is 
either that the partial assignments corresponding to the two vertices are incompatible, or that their union 
violates some constraint in f. 

In the first case, 1 — 5 U — 5 V > 0 is an SOS axiom because of the identity 

(1 - S u - 5 V ) 2 = 1 -5 U -5 V , (3.9) 

which follows from the observation that S u and 5 V are the indicator polynomials of two incompatible 
assignments and cannot evaluate to 1 simultaneously, and so (1 — S u — S v ) evaluates to either 0 or 1 and 
is identical to its square by Proposition 2.4. The degree of (3.9) is 24 n/k. 

In the second case, the two assignments corresponding to u and v are compatible but their union 
violates some initial equation / = 0 of the form (3.7a)-(3.7h). Any such / is a degree-3 indicator 
polynomial which evaluates to 1 whenever the assignment satisfies the equations 6 U 5 V = 1. This means 
that S U S V contains / as a factor. We factorize / as so that S u = f u S' u and 5 V = f v S' v . Given this 
notation, we can derive 0 < 1 — S u — S v using the indentity 

(1 - fu - fv) 2 + (.fu - s u ) 2 + (/„ - 5 V ) 2 - 2 f u f v = 1 -5 U -5 V (3.10) 

of degree at most 24 n/k. To verify (3.10), observe that the left-hand side is the sum of some squared 
polynomials and —2 f u f v = —2/ = 0. Expanding the squared polynomials and using Proposition 2.4 
repeatedly we have that ( f u ) 2 = f u , (fv) 2 = fv, (S u ) 2 = S u , and (<5 „) 2 = S v , from which we also 
conclude that 

fuSu = fu(fuS'u) = ( fu) 2 S' u = fuS’u = Su (3.11) 

and 

fvS v = fv (fvS'u) = ( fv) 2 S' v = fvS'u = S v (3.12) 

which establishes that (3.10) holds. The lemma follows. □ 

Now we can put together all the material in this section to prove a formal version of Theorem 1.2 as 
stated next. 

Theorem 3.9. There are universal constants Nq £ N + and ctQ, 0 < gq < 1, such that for every k > 1 
there exists a graph Cf . with at most kN$ = ()(/,:) vertices and a 3-CNF formula A:-Clique(G'/,.) of size 
polynomial in k with the following properties: 

1. Resolution can refute fc-Clique(Gfc) in size 2 °( fclogfc ) and width k + 1. 

2. Any SOS refutation of k -Clique (Gk) requires domain-degree o:o k. 
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Proof. Fix any positive e < 1 and let Ay = 2 24n % oy = A and n = kn e , where n e and a are the 
universal constants from Theorem 3.6. To build the graph Gk we take a 3-XOR formula (P on n variables 
and 8 n equations from the distribution in Definition 3.5. Since n > n e , Theorem 3.6 implies that there is 
a formula in the support of the distribution that is unsatisfiable and that requires degree an to be refuted 
in SOS. We fix f to be that formula and let Gk be the graph G k . constructed as in Definition 3.7. Then G k 
is A;-partite, with each part having at most 2 Un / k = Nq vertices, and the graph has no /.--clique because 
otherwise f would be satisfiable. 

Suppose that there is an SOS refutation of A’-Clique(G^) of domain-degree d. We want to argue that 
d > a ok. Since G k is /c-partite, by Proposition 3.4 the formula A:-Block (G k f also has an SOS refutation 
in domain-degree d. By Lemma 3.8, this in turn yields an SOS refutation of (P in degree 24dn/k. Now 
Theorem 3.6 implies that 24dn/k > an, and hence d > k^k = a^k. 

To conclude the proof, we can just observe that the resolution width and size upper bounds are a 
direct application of Proposition 3.2. □ 

4 Size Lower Bounds from Relativization 

Using the material developed in Section 3, we can now describe how to relativize formulas in order to 
to amplify degree lower bounds to size lower bounds in SOS . This method works for formulas that are 
“symmetric” in a certain sense, and so we start by explaining exactly what is meant by this. 

Definition 4.1 (Symmetric formula). Consider a CNF formula F on variables where i is an 
index in some domain D and j denotes a collection of other indices. For every subset of indices 
T = { / 1 , G ...., A,} C D we identify the subformula F? of F such that each clause C G F? mentions 
exactly the indices in i, so that a formula F of domain-width d can be written as 

d 

F = A A • (4-1) 

s=0 iCD 

\l\ = S 

We say that F is symmetric with respect to D if it is invariant with respect to permutations of D, i.e., if 
for every F? C F it also holds that F n ( F, where tt is any permutation on D and n (i ) is the set of 
images of the indices in i. Phrased differently, F is symmetric with respect to D if for any permutation 7r 
on D the syntactic equality F = Aren Ar(r) holds (where we recall that we treat CNF formulas as sets 
of clauses). We apply this terminology for systems of polynomial equations and inequalities in the same 
way. 

Let us illustrate Definition 4.1 by giving perhaps the most canonical example of a formula that is 
symmetric in this sense. 

Example 4.2. Recall that the CNF encoding of the pigeonhole principle with a set of pigeons D and 
holes [n] claims that there is a mapping from pigeons in IP) to holes such that no hole gets two pigeons. 
For every pigeon i G D there is a clause Vye [tv] x i,j ar| 4 for every two distinct pigeons i. i' and hole j 
there is a clause Xij V xyj. Since any permutation of the set of pigeons D gives us back exactly the 
same set of clauses (only listed in a different order) the pigeonhole principle formula is symmetric with 
respect to D. 

By now, the reader will already have guessed that another example of a symmetric formula, which 
will be more interesting to us in the currect context, is the /.’-clique formula discussed in Section 3. 

Observation 4.3. The k -Clique) G) jonn 11 la in Definition 3.1 over variables x tJ! is symmetric with re¬ 
spect to the indices i G [A;]. 
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Starting with any formula F symmetric with respect to a domain D, we can build a family of similar 
formulas by varying the size of the domain. If F has domain-width d, then for each s, 0 < s < d, the 
subformulas Fj with |f| = s in (4.1) are the same up to renaming of the domain indices in i. Hence, we 
can arbitrarily pick one such subformula to represent them all, and denote it as F s . The formulas {F s }^ =0 
are completely determined by F, and together with D they in turn completely determine F. Using this 
observation, we can generalize the formula F over domain D to any domain D' with \D'\ > d by 
defining F[D'] to be the formula 

d 

FID'} = A A Fr , (4.2) 

s=0 iC-D 

\l\ = S 

where each Eg for |fj = s is an isomorphic copy of F s with its domain indices renamed according to i. 
Let us state some simple but useful facts that can be read off directly from (4.2): 

1. For any formula F of domain-width d symmetric with respect to domain D, it holds that F[D] is 
(syntactically) equal to F. 

2. For any domains D',D" with \D'\ = I)” > d, the two formulas F\D’] and F\D"] are isomor¬ 
phic. 

3. For any D" D D' with \D'\ > d, the formula F[D”] contains many isomorphic copies of F\D']. 

When we want to emphasize the domain D of a formula F in what follows, we will denote the 
formula F as F[D]. When the domain is D = [i], we abuse notation slightly and write F[t] instead 
of F[[f]]. As discussed above, from a symmetric formula F of domain-width d we can obtain a well- 
defined sequence of formulas F[t] for all t > d. We say that the unsatisfiability threshold of such a 
sequence of formulas is the least t such that F[t] is unsatisfiable. For instance, the pigeonhole principle 
formula in Example 4.2 has unsatisfiability threshold n + 1. 

4.1 Relativization of Symmetric Formulas 

Given a formula F = F[m\ symmetric with respect to [m] and a parameter k < m, we now want to 
define the k-relativization of F[m\, which is intended to encode the claim that that there exists a subset 
D C [m] of size \D\ > k such that the subformula I' D] C F\m] is satisfiable. We remark that a CNF 
formula encoding such a claim will be unsatisfiable when k is at least the unsatisfiability threshold of F. 

In order to express the existence of the subset D we use selectors s\, S 2 , ■ ■ ■, s m as indicators of 
membership in the subset and encode the constraint on the subset size \D\ = , s t > k as described 

in the next definition. 

Definition 4.4. The threshold-k formula for variables s = {si, ..., s rn } is the 3-CNF formula Thr^.(.?) 
that consists of the clauses 


Ve, 0 

£e [As], 

(4.3a) 

Ve,i -i v Pe,i v ye,i 

£ G [k] ,i £ [m] , 

(4.3b) 

Ve,m 

i £ [m], 

(4.3c) 

Pe,i v Pv,i 

£,£' £ [k\,£ ^ £',i £ [m], 

(4.3d) 

Pe,i v 

£ £ [k\,i £ [m] . 

(4.3e) 


To see that Thr/,.(T) indeed enforces a cardinality constraint, note that the variables pg^ encode a 
mapping between [k] and [m] (with pt j being true if and only if £ maps to i). The clauses (4.3a)-(4.3c) 
force every £ £ [&;] to have an image in [m], since they form the 3-CNF representation of clauses \J t Pe,i- 
The clauses (4.3d) forbid two distinct elements of [k\ to have the same image, so there must be at least k 
elements in the range of the map, and for each of them the corresponding selector must be true because 
of the clauses (4.3e). We will need the following properties of the threshold formula. 
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Observation 4.5. The formula Th r/.(.T) in Definition 4.4 has the following properties: 

1. Thrfc(.s) has size polynomial in both k and m. 

2. For any partial assignment to s with at least k ones there is an assignment to the extension vari¬ 
ables that satisfies Thr^(s). 


3. There is a resolution refutation of the set of clauses Thr^(^) U {\J ieD Si\D C [m], \D\ = A;} of 
size 0(/cm fc ) and width k + 1. 

Proof The first two items are immediate. In order to show the third item we can first derive each clause 


p l V.. Np k ik by resolving V... Vsi k with clauses of the form (4.3e), and then apply Lemma 2.7 . □ 

Using the formula in Definition 4.4 to encode cardinality constraints on subsets, we can now define 
formally what we mean by the relativization of a symmetric formula. 

Definition 4.6 (Relativization). Given a CNF formula F symmetric with respect to a domain [m] and a 
parameter k < m, the k-relativization (or k-relativizedformula) F[k ; m] is the formula consisting of 

1. the threshold formula Thr/, : (s) over selectors s = {si,..., s rn }; 

2. a selectable clause V ... V Sj s V C for each clause C £ F[m\, where {ii, ■ ■ ■, are the 
indices mentioned by C. 

Since we are dealing with refutations of unsatisfiable formulas, it will always be the case that the 
parameter k in Definition 4.6 is at least the unsatisfiability threshold of F. An important property of 
relativized formulas is that the hardness of F[k ; m] scales nicely with m. In particular, if F[k\ is not too 
hard, then the relativization F[k] m] also is not too hard. 

Proposition 4.7. If F[k\ has a resolution refutation of size S and width w, then F[k; m] has a resolution 
refutation of size S ■ + O (kmfi) and width w + k. 

Proof For every set D C [m] with \D\ = k we show how to derive 



(4.4) 


in size 5+1 and width w+k from F[k; m]. Without loss of generality (because of symmetry) we assume 
that D = [k\, so that we want to derive + V • • • V+.. Consider the assignment p = {si = 1,..., s k = 1}. 
In the restricted formula F\k ; m] the selectable clauses in Definition 4.6, item 2, with all indices in [k] 
become the clauses of F[k\, which has a refutation of size 5 and width w. Thus the clause si V • • • V s k 
can be derived in size 5 + 1 and width w + k from F[k; m] by Fact 2.5. After we have derived all clauses 
of the form (4.4) in this way, we can obtain the empty clause in width k + 1 and in size at most O ( km k ) 
using Observation 4.5. □ 

4.2 Random Restrictions and Size Lower Bounds 

To prove size lower bounds on refutations of relativized formulas F[k]m] we use random restrictions 
sampled as follows. 

Definition 4.8 (Random restrictions for relativized formulas). Given a relativized formula +[/,;: m\, 

we define a distribution IZ of partial assignments over the variables of this formula by the following 
process. 

1. Pick uniformly at random a set D C [m] of size k. 

2. Fix ^ to lift € D and to 0 otherwise. 
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3. Extend this to any assignment to the remaining variables of the formula Thirds) that satisfies this 
threshold formula. 

4. For every variable that has index i 0 I), fix x,j to 0 or 1 uniformly and independently at 
random. 

5. All remaining variables x t j for the indices i £ D are left unset. 

It is straightforward to verify that the distribution 7 Z is constructed in such a way as to give us back 
F[k] from F[k;m\. 

Observation 4.9. For any relativized formula F[k: m] and any p € 1Z it holds that F[k ; m] \ p is equal 
to F[k\ up to renaming of variables. 

The key technical ingredient in the size lower bound on sums-of-squares proofs is the following 
property of the distribution 1Z, which was proven in [AM013, ALN14] but is rephrased below using the 
notation and terminology in this paper. We also provide a brief proof sketch just to give the reader a 
sense of how the argument goes. 

Lemma 4.10 ( [AM013, ALN14] ). Let k. £, m be positive integers such that m > 16 and £ < k < 
m/(41ogm). Let M be a monomial over the variables of F[k',m] and let p be a random restriction 
sampled from the distribution 1Z in Definition 4.8. Then the domain-degree of M\ p is less than t with 
probability at least 1 — (4k\ogm) k /rr/. 

Proof sketch. Ley £' be the domain-degree of M. The restriction p will set independently and uniformly 
at random at least £' — k of its variables, so if (£' — k) is larger than £ log m, the restricted monomial 
M\ p is non zero with probability at most 1 /m . Otherwise we upper bound the probability that M\ p has 
domain-degree l with the probability that the £' indices in M contain t of the k surviving indices. By a 
union bound this probability is at most (4k log m) k /mf. □ 

Using Lemma 4.10, it is now straightforward to show that relativization amplifies degree lower 
bounds to size lower bounds. 

Theorem 4.11. Let k,£,m be positive integers such that m > 16 and £ < k < m/(41ogm). If 
the CNF formula F[k] requires sums-of-squares refutations of domain-degree t, then the relativized 
formula F[k ; m\ requires sums-of-squares refutations of size / (4k log m) k . 

Proof. Suppose that there is a sums-of-squares refutation of F[k] m] in size S, i.e., containing S mono¬ 
mials. For p sampled from 7Z, we see that the probability that some monomial in the refutation restricted 
by p has domain-degree at least £ is at most 


S.khPpL ( 4 . 5 ) 

m t 

by appealing to Lemma 4.10 and taking a union bound. 

As noted in Observation 4.9, the formula F[k; m] \ p is equal to F[k] up to renaming of variables, and 
so it cannot have a refutation of domain-degree £ or less. This implies that the bound on the probabil¬ 
ity (4.5) is greater than one, and thus we obtain 


S> 


£ 

m 

(4k\ogm) k 


(4.6) 


which proves the theorem. 


□ 
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4.3 Statement of Main Result and Discussion of Possible Improvements 

Putting everything together, we can establish the formal version of our main results in Theorem 1.1 as 
follows. 

Theorem 4.12. Let k = k(m ) be any monotone non-decreasing integer-valued function such that 
k(m ) < m/(41ogm). Then there is a family of 4-CNF formulas t with O (km 2 ) clauses 

over O (km) variables such that: 

1. Resolution can refute F m ^ in size k°^m k and width 2k + 1. 

2. Any sums-of-squares refutation of F m ] t requires size 0(m a ° k /{4k log where a o is a uni¬ 

versal constant. 

Proof Let G be a graph with properties as in Theorem 3.9 and let F[k] be the CNF formula /c-Clique(G) 
in Definition 3.1. Since F[k] is symmetric, we can relativize it as in Definition 4.6 to obtain F[k: m\, 
which will be our 4-CNF formula F m j... Theorem 3.9 says that F[k] has a resolution refutation of 
size k ()<k> and width k + 1, and appealing to Proposition 4.7 we get a resolution refutation of F m p. in 
size k°(O m k and width 2k + 1. Since we have a domain-degree lower bound of a-ok for refuting F[k] 
according to Theorem 3.9, we can use Theorem 4.11 to deduce that the required size to refute F rri k in 
sums-of-squares is at least Q(m a ° k /(4 k log m ) k ). The theorem follows. □ 

We remark that straightforward calculations show that when k{m) = 0(m s ) for 6 < o.o the upper 
bound in Theorem 4.12 is m°^ and the lower bound is m n ( k \ 

Let us now discuss a couple of the parameters in Theorem 4.12 and how they could be improved 
slightly. We stated our main theorem for 4-CNF formulas, since that is the clause size that results 
naturally from our construction. Flowever, if one wants to minimize the clause width and obtain an 
analogous result for 3-CNF formulas this is also possible to achieve, just as was done in [ALN14] for 
other proof systems. To prove a version of Theorem 4.12 for 3-CNF formulas we need a simple but 
rather ad-hoc variation of the relativization argument presented above. Let us briefly describe what 
modifications are needed. 

The way we presented the construction above, we started with the 3-CNF formula fc-Clique(G) and 
then applied relativization, which turned the clauses (3.1c)-(3.1e) into the 4-CNF formula 

Si V Zifl i £ [ k ], (4.7a) 

St V z V x itVj V Zij i £ [fc], j £ [27], (4.7b) 

Si V Zi, N i£[k], (4.7c) 

An alternative approach would be to first encode fc-Clique(G) with wide clauses VjrLi x i,vj instead of 

clauses of the form (3.1c)-(3.1e), relativize this new, wide formula, and then convert the relativized 
formula into 3-CNF using extension variables. Instead of clauses (4.7c)-(4.7c), this would yield the 
collection of clauses 


Si v Zifl 

i £ [k], 

(4.8a) 

z i,(j—1 ) V %i^Vj V Zij 

i £ [A;], j £ [27], 

(4.8b) 

%i,N 

i £ [k\. 

(4.8c) 


This causes a small technical problem in that some of these clauses mention i £ [m] but lack the 
literal s*, and so a random restriction sampled as in Definition 4.8 may actually falsify these clauses. The 
solution to this is to change the random assignment so that when s t = 0 , we fix each x r , v . uniformly at 
random in {0,1}, set each equal to the value assigned to Xi >Vj , and finally fix z^n to 0. The new 

restriction satisfies all clauses (4.8a)-(4.8c), and the proof of Lemma 4.10 still goes through. 

Another parameter in Theorem 4.12 that could be improved is the value of op, which determines 
how tightly the size lower bound matches the upper bound implied by width/degree and also how high 
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5 Concluding Remarks 


we can push k(m). In our reduction from a 3-XOR formula <j> to the clique formula fc-Clique(G^) we 
start by splitting the 8 n constraints into k blocks. The vertices in each block correspond to assignments 
to 24 n/k variables, and because of this an SOS refutation in domain-degree d of Clique (G^) can be 
converted to a refutation in degree 24 dn/k of 0. 

If we want to obtain a more efficient reduction, we could instead split the n variables, rather than the 
8 n constraints, into k parts. In this way each vertex in G/ would correspond to an assigment to n/k vari¬ 
ables, and an SOS refutation in domain-degree d would translate to a refutation of 0 in degree dn/k. But 
now we cannot reduce to the clique problem anymore. Splitting with respect to constraints allows us to 
enforce pairwise consistency between vertices in different blocks referring to common variables. When 
splitting with respect to variables, the vertices in different blocks correspond to partial assigments on 
disjoint domains and so are always pairw ise compatible. However, we must still require that these partial 
assignments are consistent with the constraints in 0. Each such constraint refers to up to three blocks. 
Thus, any satisfying assignment to 4> corresponds to k vertices such that no triple of vertices violates an 
3-XOR constraint. This reduces to the problem of finding a /c-hyperclique in a 3-uniform hypergraph. 
The rest of the reduction can be made to work as in Lemma 3.8. In the end we get an analogous result 
of that in Theorem 3.9 but with «o equal to a instead of which also improves Theorem 4.12. In this 
paper we instead presented a reduction to the /r-cliquc problem for standard graphs, partly because we 
believe that a degree lower bound for this problem can be considered to be of independent interest. 


5 Concluding Remarks 

In this paper, we show that using Lasserre semidefinite programming relaxations to find degree-d sums- 
of-squares proofs is optimal up to constant factors in the exponent of the running time. More precisely, 
we show that there are constant-width CNF formulas on n variables that are refutable in sums-of-squares 
in degree d but require proofs of size rr l - d k 

As for so many other results for the sums-of-squares proof system, in the end our proof boils 
down to a reduction from 3-XOR using Schoenebeck’s version [Sch08] of Grigoriev’s degree lower 
bound [GriOlb]. It would be very interesting to obtain other SOS degree lower bounds by different 
means than by reducing from Grigoriev’s results for 3-XOR and knapsack. 

Another interesting problem would be to prove average-case SOS degree lower bound for k-clique 
formulas over Erdos-Renyi random graphs, or size lower bounds for (non-relativized) fc-clique formulas 
over any graphs. In this context, it might be worth to point out that the problem of establishing proof size 
lower bounds for fc-clique formulas for constant k, which has been discussed, for instance, in [BGLR12], 
still remains open even for the resolution proof system (although lower bounds have been shown for 
tree-like resolution in [BGL13] and for full resolution for a version of clique formulas using a different 
encoding more amenable to lower bound techniques in [LPRT13]). 
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